Collecting Vulnerable Source Code from Open-Source Repositories for Dataset Generation
نویسندگان
چکیده
منابع مشابه
Analysis of Source Code Repositories
Source code repositories are designed to store a huge amount of source code. They also collect indirectly information useful to analyze the development process. Usually, the last set of data is not used at all due to the lack of specialized tools to collect and analyze such data. This paper presents the early stages of a tool designed to perform acquisition and analysis of data stored in source...
متن کاملQuality Classifiers for Open Source Software Repositories
Open Source Software (OSS) often relies on large repositories, like SourceForge, for initial incubation. The OSS repositories offer a large variety of meta-data providing interesting information about projects and their success. In this paper we propose a data mining approach for training classifiers on the OSS metadata provided by such data repositories. The classifiers learn to predict the su...
متن کاملAn Empirical Analysis of the Open Source Development Process Based on Mining of Source Code Repositories
This paper presents an empirical analysis of the Open Source development process from the point of view of the involvement of the developers in the production process. The study focuses on how developers contribute to projects in terms of involvement, size and kind of their contribution. Data have been collected from 53 Open Source projects and target application domains include different areas...
متن کاملSource Code Repositories and Agile Methods
Source repositories are a promising database of information about software projects. This paper proposes a tool to extract and summarize information from CVS logs in order to identify whether there are differences in the development approach of Agile and non-Agile teams. The tool aims to improve empirical investigation of the Agile Methods (AMs) without affecting the way developers write code. ...
متن کاملModel-Based Mining of Source Code Repositories
The Mining Software Repositories (MSR) field analyzes the rich data available in source code repositories (SCR) to uncover interesting and actionable information about software system evolution. Major obstacles in MSR are the heterogeneity of software projects and the amount of data that is processed. Model-driven software engineering (MDSE) can deal with heterogeneity by abstraction as its cor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied Sciences
سال: 2020
ISSN: 2076-3417
DOI: 10.3390/app10041270